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ABSTRACT 

This  paper  presents  a  novel  panning  algorithm  called  Speaker-Placement  Correction  Amplitude  Panning  (SPCAP) 
which  guarantees  conservation  of  loudspeaker  power  output.  The  method  is  appropriate  for  any  speaker 
arrangement  (e.g.  ITU  5.1,  10.2,  etc.),  and  scales  with  the  number  of  speakers.  SPCAP  works  by  correcting  initial 
pan  values  based  on  speaker  placement  to  achieve  constant  power  output.  Because  panning  occurs  over  an  arbitrary 
number  of  speakers  (i.e.  is  not  pair-wise),  SPCAP  provides  two  significant  advantages  over  discrete  panning 
schemes.  First,  pan  values  for  current  and  future  surround-sound  formats  (e.g.  5.1  and  10.2)  are  guaranteed  to 
conserve  power  under  any  lower-resolution  setup,  making  dynamic  up/down  mixing  in  non-standard  setups  feasible. 
Second,  SPCAP  provides  a  framework  for  producing  wide  (non  point-source)  sounds. 


1.  INTRODUCTION 

Recent  work  on  multichannel  pan-pots  has  focused  on 
particular  loudspeaker  configurations  (e.g.  ITU  5.1  and 
10.2)  [1]  [2],  Diverse  areas  such  as  virtual  reality  and 
computer  music  require  panning  methods  that  allow 
reasonable  performance  on  both  standard  and  non¬ 
standard  loudspeaker  configurations.  This  work 
addresses  the  need  for  such  flexible  pan-pot  generation. 


The  VBAP  method  presented  in  [3]  was  the  first  to 
generate  layout-independent  pan-pots. 

We  have  created  a  framework  for  designing  pan-pots 
that  are  independent  of  loudspeaker  configuration.  The 
contributions  of  this  work  are  a  set  of  pan-pot  rules 
guaranteed  to  conserve  power  under  rotation  and  a 
framework  for  rendering  wide  (rather  than  point-source) 
sounds.  In  broad  terms,  the  method  works  by 
determining  the  significance  of  each  loudspeaker  to  the 
total  output.  By  weighting  each  loudspeaker 
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appropriately,  SPCAP  computes  the  number  of  effective 
speakers  per  channel.  Any  power  loss  is  tracked;  the 
lost  power  is  then  added  back  into  the  system  and 
distributed  according  to  each  channel’s  contribution  to 
the  total  power  output.  It  can  therefore  be  seen  that 
power  output  is  constant  under  rotation. 

2.  BACKGROUND 

Researchers  have  sought  optimal  pan-pot  laws  since  the 
beginning  of  stereophony  [5].  Though  the  most  popular 
and  well-known  pan-pot  is  pair-wise  amplitude  panning 
[1],  there  exist  several  panning  methods  that  are  not 
pair-wise,  most  notably  [1]  [2]  [3]  [4],  Pulkki's  VBAP 
is  the  method  most  pertinent  to  the  one  presented  here. 
Both  VBAP  and  our  method  generate  multi-speaker 
pan-pots  dynamically,  based  on  loudspeaker  position 
information.  However,  the  central  aims  of  the  two 
methods  differ:  while  VBAP  aims  to  produce 
maximally  sharp  images,  our  method  is  specifically 
targeted  to  producing  wide  sounds.  The  VBAP  method 
aims  to  generalize  pair-wise  panning  to  three 
dimensions.  While  VBAP  is  the  basic  inspiration  for 
our  method,  we  endeavor  to  overcome  three  problems 
of  VBAP: 

(i)  VBAP  requires  interpolation  between  source 
positions  to  avoid  disturbing  effects  [3] 

( ii j  Because  of  (i),  power  output  is  not  constant 

during  interpolation  [3]. 

(iii)  Implementing  spread  as  described  in  [3]  is 
costly,  being  linearly  proportional  to  desired 
source  width  and  speaker  number. 

The  first  two  problems  may  be  overcome  within  the 
VBAP  framework  by  computing  the  proper  gain  values 
at  each  time-step.  Problem  (iii)  is  more  fundamental 
than  (i)  or  (ii). 

3.  METHOD 

Speaker-Placement  Correction  Amplitude  Panning 
(SPCAP)  is  a  novel  panning  algorithm  for  3D  audio. 
This  method  extends  the  notion  of  cosine-weighted 
panning  by  correcting  gain  values  according  to  source 
and  loudspeaker  locations.  By  weighting  each 
loudspeaker  channel  appropriately,  SPCAP  computes 
the  number  of  effective  speakers  per  channel.  It  then 
computes  appropriate  gain  values  in  two  passes  over  the 
speaker  array.  The  first  pass  achieves  gain 


normalization  (Section  3.1)  and  the  second  pass  ensures 
power-conservation  (Section  3.2).  See  Figure  1.  For 
clarity  all  equations  assume  that  a  listener  is  located  at 
the  origin,  surrounded  by  speakers  on  a  unit  sphere.  1 

3.1.  Gain  Normalization 

We  compute  the  initial  pan  values  using  a  normalized 
cosine-weighting  based  on  the  angle  between  the  virtual 
source  and  each  loudspeaker.  In  an  ITU  setup,  for 
example,  a  source  at  zero  degrees  would  result  in  a  pan 
value  of  1 .0  in  the  center  channel.  So  we  have  simply 


W)=y[1+C  a). 

where  P  is  the  initial  pan-value  assigned  to  the  i"' 

loudspeaker,  9  its  placement  angle  and  0  is  the 
direction  of  the  virtual  source. 

The  pan  values  j  P }  vary  with  speaker  placement. 

Normalizing  by  the  number  of  effective  speakers  per 
loudspeaker  removes  this  dependency.  Consider  two 
co-located  loudspeakers.  If  each  loudspeaker  receives  a 
pan  value  as  given  by(l ),  the  power  from  their  direction 
is  twice  as  high  as  appropriate.  By  viewing  this  pair  as 
the  number  of  effective  speakers  it  represents,  we  can 
reduce  their  power  by  a  factor  of  two,  achieving  the 
correct  gain  value  from  their  direction. 

We  compute  the  effective  number  of  speakers  J3.  for  a 

given  speaker  i  as  a  sum  of  the  cosines  of  the  angles 
between  i  and  each  loudspeaker  and  j.  Since  this 
includes  the  zero-angle  between  each  speaker  and  itself, 
there  is  always  at  least  one  effective  speaker. 

So  we  have, 

#=Z^[1+cos(6>-0)] 

j= o  1 


1  Allowing  speakers  beyond  the  unit  sphere  and 
allowing  the  listener  to  move  away  from  the  origin  are 
goals  for  our  future  work. 
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where  0 , 9  are  the  placement  angles  of  the  i‘h  and  j,h 
speakers,  respectively. 

Once  the  effective  speaker  weights  have  been 
computed,  we  determine  an  initial  gain  value  Gi  via 

G,(S.)=^  (3). 

Dividing  each  pan  value  P.  by  the  corresponding  /?  , 

we  normalize  each  G(  against  the  speaker  distribution. 

However,  this  is  not  sufficient  to  guarantee  that  power  is 
conserved  under  rotation. 

3.2.  Power  Conservation 

Since  power  varies  inversely  with  distance-squared  2 
[7],  we  can  determine  the  amount  of  power  the 

loudspeaker  array  should  emit.  The  emitted  power  P 
is  given  by  the  sum  of  the  squared  gains. 

P,(e)=±(G:(e)f  (4) 

1=1 

Yet  the  correct  power  L  is  inversely  proportional  to 
distance-squared,  yielding 


where  k  is  a  constant 3.  Since  amplitude-panning  alone 
is  not  able  to  represent  sounds  within  the  boundary 
formed  by  the  loudspeakers,  there  is  a  minimum 
allowed  distance  between  the  virtual  source  and  the 
listener.  We  suppose  that  the  minimum  distance  is  the 
radius  of  the  unit-sphere  upon  which  the  loudspeakers 
are  located.  Therefore,  when  a  source  is  located  on  the 


2  We  make  the  simplifying  assumption  that  the  virtual 
source  represents  an  omnidirectional  (monopole) 
emitter. 

3  The  exact  value  of  k  is  not  needed,  since  we  aim  only 
to  produce  constant  power,  not  replicate  the  power  of  a 
real  source.  Therefore,  we  take  k  =1. 


unit-sphere,  its  power  output  should  be  maximal,  (i.e. 
d  =  1  and  L  =  k  )  . 

Since  we  know  the  contribution  of  each  i'1  loudspeaker 
we  can  compute  the  ratio  of  each  loudspeaker’s  output 
to  the  total  power  as 


Dividing  L  into  portions  {  y }  dictated  by  [  Rt  }  shows 
how  much  power  should  go  to  each  i,h  loudspeaker. 

Thus  we  define  {^.J  in  Equation  (7). 

Yi  =  RL  (7) 

By  definition,  ^  R)  =  1 .  Therefore  ^  y ■  =  L  ,  which 

means  that  using  powers  {  yi  j  ensures  constant  power 
output  from  the  speakers.  The  final  gain  values 
{  Aj }  are  thus 

A=yfiA  (8k 

meaning  that  power  is  constant  for  all  source  angles. 


Figure  1 :  8-channel  gains  yielding  constant  power. 
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4.  DISCUSSION:  BENEFITS  OF  SPCAP 

The  SPCAP  method  as  formulated  above  determines  n 
gain  values  -  one  per  loudspeaker  -  that  yield  constant 
power  during  source  rotations.  Development  of  this 
method  was  in  part  motivated  by  the  need  to  overcome 
three  shortcomings  of  VBAP  as  described  in  Section  2. 
Problems  ( i)  and  (ii)  are  solved  by  the  constant  power 
guarantees  of  SPCAP. 

By  raising  the  cosine  term  in  (1)  to  a  power  OC  ,  SPCAP 
can  control  tightness ,  or  how  steeply  the  signal  dies  off 
relative  to  angle  from  the  source  as  in  Figure  2.  This 
control  allows  SPCAP  to  achieve  the  same  effect  as 
VBAP  to  approximate  non-point-source  sounds.  In  [3], 
Pulkki  refers  to  this  effect  as  spread. 

Pi{0s)  =  7L[l  +  c°s(0i-0)]a  (9) 


tightness,  it  does  not  capture  width  because  the  signals 
are  correlated. 


Figure  3:  8-channel  Gains  with  moderate  CX,  value. 


Because  the  runtime  complexity  of  SPCAP  is 
determined  by  the  number  of  speakers  (i.e.  complexity 
O (n)  ),  it  is  not  a  function  of  desired  source  width. 
Therefore,  SPCAP  also  overcomes  problem  (Hi),  as 
listed  in  Section  2. 

Power  =  1 


Tightness 

Figure  2  :  Gains  with  moderate  and  high  values  for  OC 

Using  a  as  a  function  of  distance  has  the  additional 
property  that  as  sounds  move  away  from  the 
loudspeakers,  they  become  closer  to  point-sources  [6]. 
Figures  2  and  3  illustrate  the  effect  of  raising  OC  . 

The  flexibility  of  the  SPCAP  method  provides  a 
framework  for  our  ongoing  research  into  rendering  truly 
wide  sounds.  Like  VBAP,  our  method  splits  an  input 
signal  into  an  arbitrary  number  of  output  signals.  While 
this  approach  creates  a  vague  sense  of  spread  or 


Figure  4  :  8-channel  gains  with  center  channel  displaced 
15  degrees. 

However,  the  values  j  R  }  in  SPCAP  provide  insight 

into  how  much  each  channel  contributes  to  the  output. 
This  information  could  be  used  to  inform  a  de- 
correlation  process  which  will  properly  widen  the 
sound.  Determining  these  processes  is  the  aim  of  our 
current  research. 

The  SPCAP  approach  has  several  additional  advantages 
over  discrete  panning  schemes.  First,  tills  method 
provides  a  framework  in  which  content  distribution  is 
abstracted  from  delivery  mechanism.  That  is,  pan 
values  in  SPCAP  are  extracted  from  3D  virtual  source 
positions,  which  do  not  depend  on  the  configuration  of 
the  playback  system.  Therefore,  sessions  created  using 
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one  setup  map  nicely  to  another  -  possibly  very 
different  -  setup.  Content  mixed  on  an  ITU  setup,  for 
example,  will  mix-down  to  4. 1  or  two-channel  system  in 
a  reasonable  way.  Conversely,  SPCAP  will  up-mix  a 
session  arranged  on  a  4.1  system  in  order  to  exploit  the 
additional  channels  of  a  high-resolution  setup  (e.g. 
10.2).  In  particular,  SPCAP  gracefully  handles  non¬ 
standard  setups  because  it  performs  the  gain 
determination  and  power-conservation  steps  on  a  per- 
speaker  basis.  For  example,  if  logistical  considerations 
demand  a  setup  with  no  center  channel,  SPCAP  will 
compensate,  diverting  power  to  the  nearest  speakers. 
Figure  4  shows  the  gains  for  an  8-channel  setup  where 
the  center  channel  has  been  displaced  by  15  degrees. 


Note  that  the  SPCAP  method  does  not  place  restrictions 
on  the  criteria  for  assigning  initial  gains  or  effective 
weights.  The  only  requirements  are  as  follows: 


a) 


<1, 


b)  this  ratio  varies  smoothly  and  continuously  as 
the  source  rotates. 


In  this  formulation,  we  use  cosines  in  equations  (1)  and 

(2)  because  we  desire  global  support  to  facilitate  our 
research  on  rendering  wide  sounds.  Applications  that 
do  not  desire  global  support  could  replace  (1)  with  a 
limited-support  function.  Similarly,  applications  that 
seek  to  use  panning  criteria  other  than  angle  (taking  into 
account  psycho-acoustic  phenomena,  for  example) 
should  replace  the  weighting  function  in  (2)  with  a  more 
suitable  function. 


5.  CONCLUSION 

We  have  presented  a  method  called  SPCAP  for  dynamic 
pan-pot  generation.  While  SPCAP  takes  inspiration 
from  VBAP  [3]  it  offers  a  number  of  significant 
contributions.  First,  it  guarantees  conservation  of 
power,  independent  of  the  playback  setup.  This  feature 
allows  for  dynamic  re-mixing  of  source  material. 
Second,  it  offers  a  framework  for  our  research  in 
rendering  wide  sounds.  Finally,  SPCAP  is 
computationally  inexpensive.  For  these  reasons  SPCAP 
has  many  applications  in  areas  as  diverse  as  content 
distribution,  long-distance  collaborative  music,  or 
computer  games.  The  method  is  particularly  well  suited 
to  tasks  in  which  numerous  mono-sounds  are  to  be 
mixed  dynamically  (e.g.  videogames  and  computer 
music)  or  to  create  and  deliver  content  to  venues  with 
non-standard  audio  setups. 
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